B(eo)W(u)LF: Facilitating recurrence analysis on multi-level language

نویسندگان

  • Alexandra Paxton
  • Rick Dale
چکیده

Discourse analysis may seek to characterize not only the overall composition of a given text or corpora but also the dynamic patterns within the data. Patterns of interest may occur at multiple levels, from character to sentence to corpus. Researchers may be interested in the way that sentence structures recur between participants or how affect words cluster in a single text. Recurrence analyses are an ideal tool for such investigations, but linguistic data must often be transformed prior to being analyzed. This technical report introduces a data format called the by-­‐word long-­‐form or B(eo)W(u)LF. Inspired by the long-­‐form data format required for mixed-­‐effects modeling, B(eo)W(u)LF structures linguistic data into an expanded matrix encoding any number of researchers-­‐specified markers. While we do not necessarily claim to be the first to use methods along these lines, we have created a series of tools utilizing Python and MATLAB to enable such discourse analyses. We demonstrate this analysis on 319 lines of the Old English epic poem, Beowulf, translated into modern English (Appendix 1). At the end of this report, we provide the original text, scripts adapted to the text, and (for brevity's sake) a portion of the final result. The sample text is a single file, but if a corpus is saved in multiple individual files, these scripts can be modified with " if " statements to streamline the process. Text Preparation These scripts require that text data be stored in a plain text file (.txt or .csv). If the corpus comprises separate text files, it's highly recommended that each file be transcribed or formatted identically. The data will be run through with a series of regular expressions in Python to quickly and automatically reformat the text (Appendix 2). Therefore, if individual files are formatted differently, the cleanup script will have to be tweaked for each format type. The analyses to be performed will dictate the cleanup choices. In our example, we remove commas, colons, and semicolons, but we are interested in keeping the end-­‐ of-­‐sentence punctuations (e.g., periods, question marks). We may choose to do

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Effect of Zataria multiflora Essential Oil on Histamine Production in Iranian Salted-Fermented Fish Sauce (Mahyaveh)

Background: Mahyaveh is an Iranian salted-fermented fish sauce which due to its high amount of protein has risk of histamine production. This study was carried out to determine effect of Zataria multiflora Essential Oil (EO) on histamine production in mahyaveh. Methods: Dried anchovies (Stolephorus sp.), refined-salt and mustard seed (Brassica juncea) were purchased from the local market in ...

متن کامل

What about the employees in entrepreneurial firms? A multi-level analysis of the relationship between entrepreneurial orientation, role ambiguity, and social support

Research on entrepreneurial orientation (EO) has mainly addressed outcomes of EO at the level of the firm. However, few studies have examined how EO affects employees. Using a multi-level analysis of 343 employees nested in 25 SMEs, revealed that EO will increase the degree of role ambiguity among employees. Social support from management was not found to have any effect on the relationship bet...

متن کامل

What about the employees in entrepreneurial firms? A multi-level analysis of the relationship between entrepreneurial orientation, role ambiguity, and social support

Research on entrepreneurial orientation (EO) has mainly addressed outcomes of EO at the level of the firm. However, few studies have examined how EO affects employees. Using a multi-level analysis of 343 employees nested in 25 SMEs, revealed that EO will increase the degree of role ambiguity among employees. Social support from management was not found to have any effect on the relationship bet...

متن کامل

Effect of a Childbirth Psychoeducation Program on the Level of Fear of Childbirth in Primigravid Women

Background: Severe fear of childbirth (FOC) is the most important cause of elective and emergency cesarean section and results in an unpleasant experience among women. Implementing a psychoeducational program can promote mothers’ knowledge and reduce the FOC. Aim: the aim of this study was to determine the effect of childbirth psychoeducational program on the FOC intensity in primigravid women....

متن کامل

Hypertext and the Scholarly Archive

W ith th e W eb , h y per text h as b ecom e the p ar ad igm atic r h eto rical s tr u ctu r e o f a g lo bal and d istr ibu ted ar ch iv e. Th is pap er ar g ues that th e s ch o lar ly ar ch iv e is g o ing tho u g h a p r ocess o f h y p ertextu alization that is n o t ad eq uately acco u n ted f o r in th eo r ies on h y per text. A m eth od o log ical ap p ro ach b ased o n G er ar d Gen e...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1308.2696  شماره 

صفحات  -

تاریخ انتشار 2013